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CODING SEQUENCES OF THE HUMAN BRCAl GENE 

5 

HELD OF TKK TMVFMTTOM 

This invention relates to a gene which has been associated with breast and 
ovarian cancer where the gene is found to be mutated. "More specifically, this 
invention relates to the three coding sequences of the BRCAl gene BRCAl(on^i)' 
10 BRCAl(o"^i2)^ and BRCAl(omi3)) isolated from human subjects. 

BACKGROUND OF THE INVENTION 

It has been estimated that about 5-10% of breast cancer is inherited Rowell, 
S., et al, American Journal of Human Genetics 55.:861-865 (1994). Located on 

15 chromosome 17, BRCAi is the first gene identified to be conferring increased risk 
for breast and ovarian cancer. Mild et al, ScienceJM'€>6-71 (1994). Mutations in 
this "tumor suppressor" gene are thought to account for roughly 45% of 
inherited breast cancer and 80-90% of families with increased risk of early onset 
breast and ovarian cancer. Easton et al, American Journal of Human Genetics 

20 52:678-701 (1993). 

Locating one or more mutations in the BRCAl region of chromosome 17 
provides a promising approach to reducing the high incidence and mortality 
associated with breast and ovarian cancer through the early detection of women 
at high risk. These women, once identified, can be targeted for more aggressive 

25 prevention programs. Screening is carried out by a variety of methods which 
include karyotyping, probe binding and DNA sequencing. 

In DNA sequencing technology, genomic DNA is extracted from whole 
blood and the coding sequences of the BRCAl gene are amplified. The coding 
sequences might be sequenced completely and the results are compared to the 

30 DNA sequence of the gene. Alternatively, the coding sequence of the sample 
gene may be compared to a panel of known mutations before completely 
sequencing the gene and comparing it to a normal sequence of the gene. 
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If a mutation in the BRCAl coding sequence is found, it may be possible to 
provide the individual with increased expression of the gene through gene 
transfer therapy. It has been demonstrated that the gene transfer of the BRCAl 
coding sequence into cancer cells inhibits their grourth and reduces 
5 tumorigenesis of human cancer cells in nude mice. Jeffrey Holt and his 
colleagues "conclude' that the product of BRCAl expression is a secreted tumor 
growth inhibitor, making BRCAl an ideal gene for gene therapy studies. 
Transduction of only a moderate percentage of tumor cells apparently produces 
enough growth inhibitor to inhibit all tumor cells. Arteaga, CL, and JT Holt 
10 Cancer Research 56: 1098-1103 (1996), Holt, JT et al.. Nature Genetics 12; 298-302 
(1996). 

The observation of Holt et al, that the BRCAl growth inhibitor is a 
secreted protein leads to the possible use of injection of the growth inhibitor into 
the area of the tumor for tumor suppression. 

15 The BRCAl gene is divided into 24 separate exons. Exons 1 and 4 are 

noncoding, in that they are not part of the final functional BRCAl protein 
product. The BRCAl coding sequence spans roughly 5600 base pairs (bp). Each 
exon cor\sists of 200-400 bp, except for exon 11 which contains about 3600 bp. To 
sequence the coding sequence of the BRCAl gene, each exon is amplified 

20 separately and the resulting PCR products are sequenced in the forward and 
reverse directions. Because exon 11 is so large, we have divided it into twelve 
overlapping PCR fragments of roughly 350 bp each (segments "A" through "L" of 
BRCAl exon 11). 

Many mutations and polymorphisms have already been reported in the 
25 BRCAl gene. A world wide web site has been built to facilitate the detection and 
characterization of alterations in breast cancer susceptibility genes. Such 
mutations in BRCAl can be accessed through the Breast Cancer Information Core 
at: http://www.nchgr.nih.gov/dir/lab_transfer/bic. This data site became 
publicly available on November 1, 1995. Friend, S. et al. Nature Genetics 11:238, 
30 (1995). 
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The genetics of Breast /Ovarian Cancer Syndrome is autosomal dominant 
with reduced penetrance. In simple term3, this means that the syndrome runs 
through families such that both sexes can be carriers (only women get the disease 
but men can pass it on), all generations will likely have breast/ovarian or both 
5 diseases and sometimes in the same individual, occasionally women carriers 
either die young before they have the time to manifest disease (and yet offspring 
get it) or they never develop breast or ovarian cancer and die of old age (the latter 
people are said to have "reduced penetrance" because they never develop cancer). 
Pedigree analysis and genetic counseling is absolutely essential to the proper 
10 workup of a family prior to any lab work. 

Until now, only a single coding sequence for the BRCAl gene has been 
available for comparison to patient samples. That sequence is available as 
GenBank Accession Number U14680. There is a need in the art, therefore, to 
have available a coding sequence which is the BRCAl coding sequence found in 
15 the majority of the population, a "consensus coding sequence", BRCAl^^^^^) Seq. 
ID, NO. 1. A consensus coding sequence wiU make it possible for true mutatior\s 
to be easily identified or differentiated from polymorphisms. Identification of 
mutations of the BRCAl gene and protein would allow more widespread 
diagnostic screening for hereditary breast and ovarian cancer than is currently 
20 possible. Two additional coding sequences have been isolated and characterize. 
The BRCAKo"^^) SEQ. ID. NO.: 3, and BRCAl(o'^i^) SEQ. ID. NO.:5 coding 
sequences also have utility in diagnosis, gene therapy and in making therapeutic 
BRCAl protein. 

A coding sequence of the BRCAl gene which occurs most commonly in 
25 the human gene pool is provided. The most commonly occurring coding 
sequence more accurately reflects the most likely sequence to be found in a 
subject. Use of the coding sequence BRCAl Co^^) SEQ. ID. NO.: 1, rather than the 
previously published BRCAl sequence, will reduce the likelihood of 
misinterpreting a. "sequence variation" found in the population (i.e. 
30 polymorphism) with a pathologic "mutation" (i.e. causes disease in the 
individual or puts the individual at a high risk of developing the disease). With 
large interest in breast cancer predisposition testing, misinterpretation is 
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particularly worrisome. People who already have breast cancer are asking the 
clinical question: "is my disease caused by a heritable genetic mutation?" The 
relatives of the those with breast cancer are asking the question: "Am I also a 
carrier of the mutation my relative has? Thus, is my risk increased, and should I 
5 imdergo a more aggressive surveillance program." 

SUMMARY OF THE INVENTION 

The present invention is based on the isolation of three coding sequences 
of the BRCAl gene found in human individuals. 
10 It is an object of the invention to provide the most commonly occurring 

coding sequence of the BRCAl gene. 

It is another object of this invention to provide two other coding 
sequences of BRCAl gene. 

It is another object of the invention to provide three protein sequences 
15 coded for by three of the coding sequences of the BRCAl gene. 

It is another object of the invention to provide a list of the codon pairs 
which occur at each of seven polymorphic points on the BRCAl gene. 

It is another object of the invention to provide the rates of occurrence for 
the codons. 

20 It is another object of the invention to provide a method wherein BRCAl, 

or parts thereof, is amplified with one or more oligonucleotide primers. 

It is another object of this invention to provide a method of identifying 
individuals who carry no mutation(s) of the BRCAl coding sequence and 
therefore have no increased genetic susceptibility to breast or ovarian cancer 
25 based on their BRCAl genes. 

It is another object of this invention to provide a method of identifying a 
mutation leading to an increased genetic susceptibility to breast or ovarian 
cancer. 

There is a need in the art for a sequence of the BRCAl gene and for the 
30 protein sequence of BRCAl as well as for an accurate list of codons which occur at 
polymorphic points on a sequence. 
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A person skilled in the art of genetic susceptibility testing will find the present 
invention' useful for: 

a) identifying individuals having a BRCAl gene with no coding 
mutations, who therefore cannot be said to have an increased 

5 genetic susceptibility to breast or ovarian cancer from their BRCAl 

genes; 

b) avoiding misinterpretation of polymorphisms found in the BRCAl 
gene; 

c) determining the presence of a previously unknown mutation in the 
10 BRCAl gene. 

d) identifying a mutation which increases the genetic susceptibility t o 
breast or ovarian cancer. 

e) probing a human sample of the BRCAl gene. 

f) performing gene therapy. 

15 g) for making a functiorung tumor growth inhibitor protein coded for 

by one of the BRCAl^^ genes. 

BRIEF DESCRIPTIQN OF THE HGURE 

As shown in FIGURE 1, the alternative alleles at polymorphic (non-mutation 
20 causing variations) sites along a chromosome can be represented as a "haplotype" 
within a gene such as BRCAl. The BRCAl (o"^^) haplotype is shown in Figure 1 
with dark shading (encompassing the alternative alleles found at nucleotide sites 
2201, 2430, 2731, 3232, 3667, 4427, and 4956). For comparison, the haplotype that is 
in GenBank is shovra with no shading. As can be seen from the figure, the 
25 common "consensus" haplotype is found intact in five separate chromosomes 
labeled with the OMI symbol (numbers 1-5 from left to right). Two additional 
haplotypes (BRCAl C^^^), and BRCAl<omi3) are represented with mixed dark and 
light shading (numbers 7 and 9 from left to right). In total, 7 of 10 haplotypes 
along the BRCAl gene are unique. 

30 
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DETAILED DESCRIPTION OF THE INVENTION 
DEFINITIONS 

The following definitions are provided for the purpose of understanding this 
5 invention. 

"Breast and Ovarian cancer" is understood by those skilled in the art to 
include breast and ovarian cancer in v^omen and also breast and prostate cancer 
in men, BRCAl is associated genetic susceptibility to inherited breast and 
10 ovarian cancer in women and also breast and prostate cancer in men. Therefore, 
claims in this document which recite breast and /or ovarian cancer refer to breast, 
ovarian and prostate cancers in men cmd women. 

" Coding sequence'' or " DNA coding sequence" refers to those portions of a 
15 gene which, taken together, code for a peptide (protein), or which nucleic acid 
itself has fimction. 

" Protein" or "peptide" refers to a sequence amino acids which has 
function. 

20 "BRCAK^^^O" refers collectively to the "BRCAl (o^^)", "BRCAi(onii2)" and 

"BRCA1(^^3^" coding sequences. 

"BRCAl (^"^^)" refers to SEQ. ID. NO.: 1, a coding sequence for the BRCAl 
gene. The codiag sequence was fotmd by end to end sequencing of BRCAl alleles 
25 from individuals randomly drawn from a Caucasian population found to have 
no family history of breast or ovarian cancer. The sequenced gene was found not 
to contain any mutations. BRCAl(o"^i^) was determined to be a consensus 
sequence by calculating the frequency with which the coding sequence occurred 
among the sample alleles sequenced. 



30 



"BRCAl(o«^i2)" and "BRCAlt^^^^)" refer to SEQ. ID. NO.: 3, and SEQ. ID. NO.: 
5 respectively. They are two additional coding sequences for the BRCAl gene 
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which were also isolated from individuals randomly drawn from a Caucasian 
population found to have no family history of breast or ovarian cancer, 
polymorphisms 

5 "Primer" as used herein refers to a sequence comprising about 20 or more 

nucleotides of the BRCAl gene. 

"Genetic susceptibility" refers to the susceptibility to breast or ovarian 
cancer due to the presence of a mutation in the BRCAl gene. 

10 

A "target polynucleotide" refers to the nucleic acid sequence of interest e.g., 
the BRCAl encoding polynucleotide. Other primers which can be used for 
primer hybridization will be known or readily ascertainable to those of skill in 
the art, 

15 

"Consensus" means the most commonly occurring in the population. 
"Consensus genomic sequence" means the allele of the target gene which 
occurs with the greatest frequency in a population of individuals having no 
family history of disease associated with the target gene. 

20 

"Substantially complementary to" refers to a probe or primer sequences 
which hybridize to the sequences provided under stringent conditions and /or 
sequences having sufficient homology with BRCAl sequences, such that the 
allele specific oligonucleotide probe or primers hybridize to the BRCAl sequences 
25 to which they are complimentary. 

"Haplotype" refers to a series of alleles within a gene on a chromosome. 

"Isolated" as used herein refers to substantially free of other nucleic acids, 
30 proteins, lipids, carbohydrates or other materials with which they may be 
associated. Such association is typically either in cellular material or in a 
synthesis medium. 
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"Mutation" refers to a base change or a gain or loss of base pair(s) in a DNA 
sequence, which results in a DNA sequence which codes for a non-functioning 
protein or a protein with substantially reduced or altered function. 

5 

"Polymorphism" refers to a base change which is not associated with 
known pathology. 

"Tumor growth inhibitor protein" refers to the protein coded for by the 
10 BRCAl gene. The functional protein is thought to suppress breast and ovarian 
tumor growth. 

The invention in several of its embodiments includes: 

1. An isolated cor\sensus DNA sequence of the BRCAl coding sequence as set 
forth in SEQ. ID. NO.: 1. 

15 

2. A consensiis protein sequence of the BRCAl protein as set forth. in 
SEQ. ID. NO.: 2. 

3. An isolated coding sequence of the BRCAl gene as set forth in 
20 SEQ. ID. NO.: 3. 

4. A protein sequence of the BRCAl protein as set forth in 
SEQ. ID. NO.: 4 . 

25 5. An isolated coding sequence of the BRCAl gene as set forth in 
SEQ. ID. NO.: 5. 

6. A protein sequence of the BRCAl protein as set forth in SEQ. ID. NO.: 6. 

30 

7. A BRCAl gene with a BRCAl coding sequence not associated with 
breast or ovarian cancer which comprises aji. alternative pair of codons, AGO 
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and AGT, which occur at position 2201 at frequencies of about 35-45%, and 
from about 55-65%, respectively, 

8. A BRCAl gene according to Claim 7 wherein AGC occurs at a 
5 frequency of about 40%. 

9. A set of at least two alternative codon pairs which occur at 
polymorphic positions in a BRCAl gene with a BRCAl coding sequence not 
associated with breast or ovarian cancer, wherein codon pairs are selected 

10 from the group consisting of: 

• AGC and AGT at position 220 1 : 
TTG and CTG at position 2430: 
CCG and CTG at position 273 1 ; 

• GAA and GGA at position 3232; 
15 • AAA and AGA at position 3667; 

TCT and TCC at position 4427; and 
AGT and GGT at position 4956. 

10, A set of at least two alternative codon pairs according to claim 9. 
20 wherein the codon pairs occur in the following frequencies, respectively, in a 
population of Individuals free of disease: 

• at position 2201, AGC and AGT occiir at frequencies from about 
35-45%,. and from about 55-65%, respectively; 

at position 2430, TTG and CTG occur at frequencies from about 
25 35-45%, and from about 55-65%, respectively; 

• at position 2731, CCG and CTG occur at frequencies from about 
25-35%, and from about 65-75%, respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

30 •at position 3667, AAA and AGA occur at frequencies from about 

35-45%, and from about 55-65%, respectively; 

• at position 4427. TCT and TCC occur at frequencies from about 
45-55%. cm.d from about 45-55%, respectively; and 

• at position 4956, AGT and GGT occur at frequencies from about 
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35-45%, and from about 55-65%, respectively. 



11 A set according to Claim 10 which is at least three codon pairs. 



5 12 A set according to Claim 10 which is at least four codon pairs. 



13. A set according to Claim 10 which is at least five codon psdrs. 
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14. A set according to Claim 10 which is at least six codon pairs. 



15 A set according to Claim 10 which is at least seven codon pairs. 



16. A method of identifying individuals having a BRCAl gene with a 
BRCAl coding sequence not associated with disease, comprising: 
15 (a) amplifying a DNA fragment of an individual's BRCAl 

coding sequence using an oligonucleotide primer which 
specifically hybridizes to sequences within the gene; 
(b) sequencing said amplified DNA fragment by dideoxy 
sequencing; 

20 (c) repeating steps (a) and (b) until said individual's BRCAl 

coding sequence is completely sequenced; 
(d) comparing the sequence of said amplified DNA fragment 
to a BRCAl DNA sequence. SEQ. ID. NOl, SEQ. ID. 
N03, or SEQ. ID, N05; 
25 (e) determining the presence or absence of each of the 

following polymorphic variation in said individual's 
BRCAl coding sequence: 

AGC and AGT at position 2201, 
TTG and CTG at position 2430, 
30 ' • CCG and CTG at position 2731. 

GAA and GGA at position 3232, 
AAA and AGA at position 3667. 
TCT and TCC at position 4427. and 
AGT and GGT at position 4956; 
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(f) determining any sequence differences between said 
individual's BRCAl coding sequences and SEQ. ID. NOl. 
SEQ. ID. N03, or SEQ. ID. N05 wherein the presence of 
said polymorphic variations and the absence of a 
variation outside of positions 2201. 2430, 2731, 3232, 
3667, 4427, and 4956, is correlated with an absence of 
increased genetic susceptibility to breast or ovarian 
cancer resulting from a BRCAl mutation in the BRCAl 
coding sequence, 

17. A method of claim 16 wherein, codon variations occur at the 
following frequencies, respectively, in a population of individuals free of 
disease: 

• at position 2201, AGC and AGT occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 
35-45%, and from about 55-65%. respectively; 

• at position 2731, CCG and CTG occur at frequencies from about 
25-35%, and from about 65-75%, respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 3667, AAA and AGA occur at frequencies from about 
35-45%, and from about 55-65%. respectively; 

• at position 4427, TCT and TCC occur at frequencies from about 
45-55%, and from about 45-55%, respectively; and 

• at position 4956, AGT and GGT occur at frequencies from about 
35-45%, and from about 55-65%. respectively, 

18. A method according to claim 16 wherein said oligonucleotide primer is 
labeled with a radiolabel, a fluorescent label a bioluminescent label, a 
chemiluminescent label, or an enzyme label. 

19, A method of detecting a Increased genetic susceptibility to breast and 
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ovarian cancer in an individual resulting from the presence of a mutation in 
the BRCAl coding sequence, comprising: 

(a) amplifying a DNA fragment of an individual's BRCAl 
coding sequence using an oligonucleotide primer which 

5 specifically hybridizes to sequences within the gene; 

(b) sequencing said amplified DNA fragment by dideoxy 
sequencing; 

(c) repeating steps (a) and (b) until said individual's BRCAl 
coding sequence is completely sequenced; 

10 (d) comparing the sequence of said amplified DNA fragment 

to a BRCAltomi) DNA sequence, SEQ. ID. NOl. SEQ. ID. 
N03, or SEQ. ID. N05: 
(e) determining any sequence differences between said 
individual's BRCAl coding sequences and SEQ. ID. NOl. 
15 SEQ. ID. N03. or SEQ. ID. N05; to determine the 

presence or absence of base changes in said individual's 
BRCAl coding sequence wherein a base change which is 
not any one of the following: 

AGC and AGT at position 2201. 
20 • TTG and CTG at position 2430. 

CCG and CTG at position 2731, 
GAA and GGA at position 3232. 
AAA and AGA at position 3667, 
TCT and TCC at position 4427. and 
25 • AGT and GGT at position 4956 is correlated with 

the potential of increased genetic susceptibility to 
breast or ovarian cancer resulting from a BRCAl 
mutation in the BRCAl coding sequence. 



30 20. A method of claim 19 wherein, codon variations occur at the follovmig 
frequencies, respectively, in a population free of disease: 

• at position 220 1 , AGC and AGT occur at frequencies from about 
40%, and from about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 
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35-45%, and from about 55-65%, respectively; 

• * at position 2731, CCG and CTG occur at frequencies from about 

25-35%, and from about 65-75%. respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 3667, AAA and AGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 4427. TOT and TCC occur at frequencies from about 
45-55%, and from about 45-55%. respectively; and 

• at position 4956. AGT and GGT occur at frequencies from about 
35-45%, and from about 55-65%. respectively, 

21. A method according to claim 19 wherein said oligonucleotide primer is 
labeled with a radiolabel, a fluorescent label a biolumlnescent label, a 
chemiluminescent label, or an enzyme label. 

22. A set of codon pairs, which occur at polymorphic positions in a 
BRCAl gene with a BRCAl coding sequence according to Claim 1, wherein 
said set of codon pairs is: 

• AGO and AGT at position 220 1 ; 

• TTG and CTG at position 2430; 
CCG and CTG at position 2731; 

• GAA and GGA at position 3232; 
AAA and AGA at position 3667; 

• TCT and TCC at position 4427; and 

• AGT and GGT at position 4956. 

23. A set of at least two alternative codon pairs according to claim 22 
wherein set of at least two alternative codon pairs occur at the following 
frequencies: 

• at position 2201. AGC and AGT occur at frequencies of about 
40%, and from about 55-65%, respectively; 

• at position 2430. TTG and CTG occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 
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at position 2731. CCG and CTG occur at frequencies from about 

25-35%. and from about 65-75%, respectively; 

at position 3232. GAA and GGA occur at frequencies from about 

35-45%» and from about 55-65%, respectively; 

at position 3667. AAA and AGA occur at frequencies from about 

35-45%. and from about 55-65%. respectively; 

at position 4427. TCT and TCC occur at frequencies from about 

45-55%. and from about 45-55%. respectively; and 

at position 4956. AGT and GGT occur at frequencies from about 

35-45%. and from about 55-65%. respectively. 
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24. A BRCAl coding sequence according to claim 1 wherein the codon 
pairs occur at the following frequencies: 

at position 2201. AGC and AGT occur at frequencies of about 
40%, and from about 55-65%, respectively; 

at position 2430, TTG and CTG occur at frequencies from about 
35-45%. and from about 55-65%. respectively; 
at position 2731, CCG and CTG occur at frequencies from about 
25-35%, and from about 65-75%. respectively; 
at position 3232, GAA and GGA occur at frequencies from about 
35-45%. and from about 55-65%. respectively; 
at position 3667, AAA and AGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 
at position 4427, TCT and TCC occxu- at frequencies from about 
45-55%, and from about 45-55%. respectively; and 
at position 4956, AGT and GGT occur at frequencies from about 
35-45%, and from about 55-65%, respectively. 
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25. A method of determining the consensus genomic sequence or consensus 
coding sequence for a target gene, comprising: 

a) screening a number of individuals in a population for a family history 
which indicates inheritance of normal alleles for a target gene; 

b) isolating at least one allele of the target gene from individuals found to 
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have a family history which indicates inheritance of normal alleles for a target 
gene; 

c) sequencing each allele; 

d) comparing the nucleic acid sequence of the genomic sequence or of the 
coding sequence of each allele of the target gene to determine similarities and 
differences in the nucleic add sequence; and 

e) determining which allele of the target gene occurs with the greatest 
frequency. 

26. A method of performing gene therapy, comprising: 

a) transfecting cancer cell in vivo with an effective amount of a 
vector transformed with a BRCAl coding sequences of SEQ. 
ID. NO.: 1, SEQ. ID. NO.: 3, or SEQ. ID. NO.: 5; 

b) allowing the cells to take up the vector, and 

c) measuring a reduction in tumor growth. 

27. A method of performing protein therapy, comprising: 

a) injecting into a patient, an effective amount of BRCAl tumor 
growth inhibiting protein of SEQ. ID. NO.: 2, SEQ. ID, NO.: 
4, or SEQ. ID. NO.: 6; 

b) allowing the cells to take up the protein, and 

c) measuring a reduction in tumor growth. 

SEQUENCING 

Any nucleic acid specimen, in purified or non-purified form, can be 
utilized as the starting nucleic acid or acids, providing it contains, or is suspected 
of containing, the specific nucleic acid sequence containing a polymorphic locus. 
Thus, the process may amplify, for example, DNA or RNA, including messenger 
RNA, wherein DNA or RNA may be single stranded or double stranded. In the 
event that RNA is to be used as a template, enzymes, and /or conditions optimal 
for reverse transcribing the template to DNA would be utilized. In addition, a 
DNA-RNA hybrid which contains one strand of each may be utilized. A mixture 
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of nucleic acids may also be employed, or the nucleic acids produced in a' 
previous amplification reaction herein, using the same or different primers may 
be so utilized. See TABLE U. The specific nucleic acid sequence to be amplified. 

I.e., the polymorphic locus, may be a fraction of a larger molecule or can be 
5 present initially as a discrete molecule, so that the specific sequence constitutes 

the entire nucleic acid. It is not necessary that the sequence to be amplified be 

present initially in a pure form; it may be a minor fraction of a complex mixture, 

such as contained in whole hxm\an DNA. 

10 DNA utilized herein may be extracted from a body sample, such as blood, 

tissue material and the like by a variety of techniques such as that described by 
Maruatis, et. al in Molecular CloningiA Laboratory Manualr Cold Spring Harbor, 
NY, p 280-281, 1982). If the extracted sample is impure, it may be treated before 
amplification with an amount of a reagent effective to open the cells, or animal 
15 cell membranes of the sample, and to expose and/or separate the strand(s) of the 
nucleic acid(s). This lysing and nucleic acid denaturing step to expose and 
separate the strands will allow amplification to occur much more readily. 

The deoxyribonucleotide triphosphates dATP, dCTP, dGTP, and dTTP are 
added to the synthesis mixture, either separately or together with the primers, in 
20 adequate amoimts and the resulting solution is heated to about 90'-100*C from 
about 1 to 10 minutes, preferably from 1 to 4 minutes. After this heating period, 
the solution is allowed to cool, which is preferable for the primer hybridization. 
To the cooled mixture is added an appropriate agent for effecting the primer 
extension reaction (called herein "agent for polymerization"), and the reaction is 
25 allowed to occur under conditions known in the art. The agent for 
polymerization may also be added together with the other reagents if it is heat 
stable. This synthesis (or amplification) reaction may occur at room temperature 
up to a temperature above which the' agent for polymerization no longer 
functions. Thus, for example, if DNA polymerase is used as the agent, the 
30 temperature is generally no greater than about 40''C. Most conveniently the 
reaction occurs at room temperature. 

The primers used to carry out this invention embrace oligonucleotides of 
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sufficient length and appropriate sequence to provide initiation of 
polymerization. Environmental conditions conducive to synthesis include the 
presence of nucleoside triphosphates and an agent for polymerization, such as 
DNA polymerase, and a suitable temperature and pH. The primer is preferably 
5 single stranded for maximum efficiency in amplification, but may be double 
stranded. If double stranded, the primer is first treated to separate its strands 
before being used to prepare extension products. The primer must be sufficiently 
long to prime the synthesis of extension products in the presence of the inducing 
agent for polymerization. The exact length of primer will depend on many 
10 factors, including temperature, buffer, and nucleotide composition. The 
oligonucleotide primer typically contains 12-20 or more nucleotides, although it 
may contain fewer nucleotides. 

Primers used to carry out this invention are designed to be substantially 
complementary to each strand of the genomic locus to be amplified. This means 
15 that the primers must be sufficiently complementary to hybridize with their 
respective strands under conditions which allow the agent for polymerization to 
perform. In other words, the primers should have sufficient complementarity 
-With the 5' and 3' sequences flanking the mutation to hybridize therewith and 
permit amplification of the genomic locus. 
20 Oligonucleotide primers of the invention are employed in the 

amplification process which is an enzymatic chain reaction that produces 
exponential quantities of polymorphic locus relative to the number of reaction 
steps involved. Typically, one primer is complementary to the negative (-) 
strand of the polymorphic locus and the other is complementary to the positive 
25 (+) strand. Annealing the primers to denatured nucleic acid followed by 
extension with an enzyme, such as the large fragment of DNA polymerase I 
(Klenow) and nucleotides, results in newly synthesized + and - strands 
containing the target polymorphic locus sequence. Because these newly 
synthesized sequences are also templates, repeated cycles of denaturing, primer 
30 annealing, and extension results in exponential production of the region (i.e., the 
target polymorphic locus sequence) defined by the primers. The product of the 
chain reaction is a discreet nucleic acid duplex with termini corresponding to the 
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ends of the specific primers employed. 

The oligonucleotide primers of the invention may be prepared using any 
suitable method, such as conventional phosphotriester and phosphodiester 
methods or automated embodiments thereof. In one such automated 
5 embodiment, diethylphosphoramidites are used as starting materials and may be 
synthesized as described by Beaucage, et al., Tetrahedron Letters, 22:1859-1862, 
1981. One method for synthesizing oligonucleotides on a modified solid support 
is described in U.S. Patent No. 4,458,066. 

10 The agent for polymerization may be any compound or system which will 

function to accomplish the synthesis of primer extension products, including 
enzymes. Suitable enzymes for this purpose include, for example, £. coli DNA 
polymerase I, Klenow fragment of E. coli DNA polymerase, polymerase muteins, 
reverse transcriptase, other enzymes, including heat-stable enzymes (e.f., those 

15 enzymes which perform primer extension after being subjected to temperatures 
sufficiently elevated to cause denaturation), such as Taq polymerase. Suitable 
enzyme will facilitate combination of the nucleotides in the proper manner to 
form the primer extension products which are complementary to each 
polymorphic locus nucleic acid strand. Generally, the synthesis will be initiated 

20 at the 3' end of each primer and proceed in the 5' direction along the template 
strand, imtil synthesis terminates, producing molecules of different lengths. 

The newly synthesized strand and its complementary nucleic acid strand ' 
will form a double-stranded molecule under hybridizing conditions described 
above and this hybrid is used in subsequent steps of the process. In the next step, 

25 the newly synthesized double-stranded molecule is subjected to denaturing, 
conditions using any of the procedures described above to provide single- 
stranded molecules. 

The steps of denaturing, annealing, and extension product synthesis can be 
repeated as often as needed to amplify the target polymorphic locus nucleic acid 

30 sequence to the extent necessary for detection. The amount of the specific nucleic 
acid sequence produced will accumulate in an exponential fashion. 
Amplification is described in PGR. A Practical Approach . ILR Press, Eds. M. J. 
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McPherson, P. Quirke, and G. R. Taylor, 1992. 

The amplification products may be detected by Southern blots analysis, 
without using radioactive probes. In such a process, for example, a small sample 
of DNA containing a very low level of the nucleic acid sequence of the 
polymorphic locus is amplified, and analyzed via a Southern blotting technique 
or similarly, using dot blot analysis. The use of non-radioactive probes or labels 
is facilitated by the high level of the amplified signal. Alternatively, probes used 
to detect the amplified products can be directly or indirectly detectably labeled, for 
example, with a radioisotope, a fluorescent compound, a bioluminescent 
compound, a chemiluminescent compound, a metal chelator or an enzyme. 
Those of ordinary skill in the art will know of other suitable labels for binding to 
the probe, or will be able to ascertain such, using routine experimentation. 

Sequences amplified by the methods of the invention can be further 
evaluated, detected, cloned, sequenced, and the like, either in solution or after 
binding to a solid support, by any method usually applied to the detection of a 
specific DNA sequence such as PGR, oligomer restriction ^ (Saiki, et.al., 
Bio/Technology,^,l008'10l2, 1985), allele-specific oligonucleotide (ASO) probe 
analysis (Conner, et, aL, Proc. Natl. Acad. Sci. U.S.A., aQ:278, 1983), 
oligonucleotide ligation assays (OLAs) (Landgren, et. aL, Science, 2^:1007, 1988), 
and the like. Molecular techniques for DNA analysis have been reviewed 
(Landgren, et aL, Sci^nce,2^:229-237, 1988). 

Preferably, the method of amplifying is by PGR, as described herein and as 
is commorUy used by those of ordinary skill in the art Alternative methods of 
amplification have been described and can also be employed as long as the 
BRCAl locus amplified by PGR using primers of the invention is similarly 
amplified by the alternative means. Such alternative amplification systems 
include but are not limited to self-sustained sequence replication, which begins 
with a short sequence of RNA of interest and a T7 promoter. Reverse 
transcriptase copies the RNA into cDNA and degrades the RNA, followed by 
reverse transcriptase polymerizing a second strand of DNA. Another nucleic 
acid amplification technique is nucleic acid sequence-based amplification 
(NASBA) which uses reverse transcription and T7 RNA polymerase and 
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incorporates two primers to target its cycling scheme. ^ NASBA can begin with 
either DNA or RNA and finish with either, and amplifies to 10^ copies within 60 
to 90 minutes. Alternatively, nucleic acid can be amplified by ligation activated 
transcription (LAT). LAT works from a single-stranded template with a single 
5 primer that is partially single-stranded and partially double-stranded. 
Amplification is initiated by ligating a cDNA to the promoter oligonucleotide 
and within a few hours, amplification is 10® to 10^ fold. Another amplification 
system useful in the method of the invention is the QB Replicase System. The 
QB replicase system can be utilized by attaching an RNA sequence called MDV-1 
10 to RNA complementary to a DNA sequence of interest. Upon mixing with a 
sample, the hybrid RNA finds its complement among the specimen's mRNAs 
and binds, activating the replicase to copy the tag-along sequence of interest. 
Another nucleic acid amplification technique, ligase chain reaction (LCR), works 
by using two differently labeled halves of a sequence of interest which are 
15 covalently bonded by ligase in the presence of the contiguous sequence in a 
sample, forming a new target. The repair chain reaction (RCR) nucleic acid 
amplification technique uses two complementary and target-specific 
oligonucleotide probe pairs, thermostable polymerase and ligase, and DNA 
nucleotides to ge^nnetrically amplify targeted sequences. A 2-base gap separates 
20 the oligonucleotide probe pairs, and the RCR fills and joiirs the gap, mimicking 
DNA repair. Nucleic acid amplification by strand displacement activation (SDA) 
utilizes a short primer containing a recognition site for hincll with short 
overhang on the 5' end which binds to target DNA. A DNA polymerase fills in 
the part of the primer opposite the overhang with sulfur-containing adenine 
25 analogs. Hindi is added but only cuts the unmodified DNA strand. A DNA 
polymerase that lacks 5' exonuclease activity enters at the cite of the nick and 
begins to polymerize, displacing the initial primer strand downstream and 
building a new one which serves as more primer. SDA produces greater than 
107-fold amplification in 2 hours at 37-C. Unlike PGR and LCR, SDA does not 
30 require instrumented Temperature cycling. 

Another method is a process for amplifying nucleic acid sequences from a 
DNA or RNA template which may be purified or may exist in a mixture of 
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nucleic acids. The resulting nucleic acid sequences may be exact copies of the 
template, or may be modified. The process has advantages over PCR in that it 
increases the fidelity of copying a specific nucleic acid sequence, and it allows one 
to more efficiently detect a particular point mutation in a single assay. A target 
5 nucleic acid is amplified enzymatically while avoiding strand displacement. 
Three primers are used. A first primer is complementary to the first end of the 
target. A second primer is complementary to the second end of the target. A 
third primer which is similar to the first end of the target and which is 
substantially complementary to at least a portion of the first primer such that 
10 when the third primer is hybridized to the first primer, the position of the third 
primer complementary to the base at the 5' end of the first primer contains a 
modification which substantially avoids strand displacement. This method is 
detailed in U.S. Patent 5,593,840 to Bhatnagar et al. 1997, Although PCR is the 
preferred method of amplification if the invention, these other methods can also 
15 be used to amplify the BRCAl locus as described in the method of the invention. 

The BRCAlC^^^^ DNA coding sequences were obtained by end to end 
sequencing of the BRCAl alleles of five subjects in the manner described above 
followed by analysis of the data obtained. The data obtained provided us with the 
opportunity to evaluate seven previously published polymorphisms and to 
20 affirm or correct where necessary, the frequency of occurrence of alternative 
codons. 

GENE THERAPY 

The coding sequences can be used for gene therapy. 
A variety of methods are known for gene transfer, any of which might be 
25 available for use. 

Direct injection of Recombinant DNA in vivo 

1. Direct injection of "naked" DNA directly with a syringe and needle into a 
specific tissue, infused through a vascular bed, or transferred through a catheter 
into endothelial cells. 

30 2. Direct injection of DNA that is contained in artificially generated lipid 
vesicles. 

3. Direct injection of DNA conjugated to a targeting structure, such as an 
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antibody. 

4. Direct injection by particle bombardment, where the DNA is coated onto 
gold particles and shot into the cells. 

5 Human Artificial Chromosomes 

This novel, gene delivery approach involves the use of human chromosomes 
that have been striped down to contain only the essential components for 
replication and the genes desired for transfer, 

10 Receptor-Mediated Gene Transfer 

DNA is linked to a targeting molecule that will bind to specific cell-surface 
receptors, inducing endocytosis and transfer of the DNA into mammalian cells. 
One such technique uses poly-L-lysine to link asialoglycoprotein to DNA. An 
adenovirus is also added to the complex to disrupt the lysosomes and thus allow 

15 the DNA to avoid degradation and move to the nucleus. Infusion of these 
particles intravenously has resulted in gene transfer into hepatocytes. 

RECOMBINANT VIRUS VECTORS 

Several vectors are used in gene therapy. Among them are the Moloney Murine 
20 Leukemia Virus (MoMLV) Vectors, the adenovirus vectors, the adeno- 
Associated Virus (AAV) vectors, the herpes simplex virus (HSV) vectors, the 
poxvirus vectors, and hxmian immunodeficiency virus (HIV) vectors, 

GENE REPLACEMENT AND REPAIR 

25 The ideal genetic manipulation for treatment of a genetic disease would be the 
actual replacement of the defective gene with a normal copy of the gene. 
Homologous recombination is the term used for switching out a section of DNA 
and replacing it with a new piece. By this technique, the defective gene can be 
replaced with a normal gene which expresses a functioning BRCAl tumor 

30 growth inhibitor protein, 

A complete description of gene therapy can also be found in "Gene Therapy A 
Primer For Physicians 2d Ed, by Kenneth W. Culver, M.D. PubL Mary Ann 
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Liebert Inc. (1996). Two Gene Therapy Protocols for BRCAl are approved by the 
Recombinant DNA Advisory Committee for Jeffrey T. Holt et al. They are listed 
as 9602-148, and 9603-149 and are available from the NIH. The isolated BRCAl 
gene can be synthesized or constructed from amplification products and inserted 
5 into a vector such as the LXSN vector. 

The BRCAl amino acid and nucleic acid sequence may be used to make 
diagnostic probes and antibodies. Labeled diagnostic probes may be used by any 
hybridization method to determine the level of BRCAl protein in serum or lysed 
cell suspension of a patient, or solid surface cell sample. 
10 The BRCAl amino acid sequence may be used to provide a level of 

protection for patients against risk of breast or ovarian cancer or to reduce the 
size of a tumor. Methods of making and extracting proteins are well known. 
Itakura et al U.S. Patents 4,704,362, 5, 221, 619, and 5,583,013. BRCAl has been 
shown to be secreted. Jensen, R.A. et al. Nature Genetics 12: 303-3Q8 (1996). 

15 

EXAMPLE 1 

Determination Of The Coding Sequence Of A BRCAl^ flnii l Gene From Five 
Individuals 

20 MATERIALS AND METHODS 

Approximately 150 volunteers were screened in order to identify 
individuals with no cancer history in their immediate family (i.e. first and 
second degree relatives). Each person was asked to fill out a hereditary cancer 
prescreening questionnaire See TABLE I below. Five of these were randomly 

25 chosen for end-to-end sequencing of their BRCAl gene. A first degree relative is 
a parent, sibling, or offspring, A second degree relative is an aunt, uncle, 
grandparent, grandchild, niece, nephew, or half-sibling. 
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TABLE I 

Hereditary Cancer Pre-Scrcening Questionnaire 

Fart A: Answer the following questions about your family 

1 . To your knowledge, has anyone in your family been diagnosed with a very specific 
hereditary colon disease called Familial Adenomatous Polyposis (FA?)? 

2. To your knowledge, have you or any aunt had breast cancer diagnosed before the age 35? 

3. Have you had Inflammatory Bowel Disease, also called Crohn's Disease or Ulcerative 
Colitis, for more than 7 years? 

p^rt Rgfer to th^ Ust Qf canceyg b^lgw fpr ypur respongg? QX\ly tg qug§rtop$ m B 
Bladder Cancer Lung Cancer Pancreatic Cancer 

Breast Cancer Gastric Cancer Prostate Cancer 

Colon Cancer Malignant Melanoma Renal Cancer 

Endometrial Cancer Ovarian Cancer Thyroid Cancer 

4- Have your mother or father, your sisters or brothers or your children had any of the listed 
cancers? 

5 Have there been diagnosed in your mother 's brothers or sisters, or your mother 's parents 

more than one of the cancers in the above list? 
6. Have there been diagnosed in your father 's brothers or sisters, or your father 's parents 

more than one of the cancers in the above list? 
C; Refer to the list of relatives below for respo nses only to questions in Part C 

You Your mother 

Your sisters or brothers Your mothers's sisters or brothers (maternal aunts and uncles) 
Your children Your mother's parents (maternal grandparents) 

7 Have there been diagnosed in these relatives 2 or more identical types of cancer? 

Do not count "simple" skin cancer, also called basal ceE or squamotos cell skin cancer. 

8. Is there a total of 4 or more of any cancers in the list of relatives above other than 
"simple" skin cancers? 

F^rt P; Refer to the list of relatives below for responses only to questions in Fart D. 

You Your father 

Your sisters or brothers Your fathers's sisters or brothers (paternal aunts and uncles) 
Your children Your father's parents (paternal grandparents) 

9. Have there been diagnosed in these relatives 2 or more identical types of cancer? 
Do not count "simple" skin cancer, also called basal cell or squamous cell skin cancer. 

10. Is there a total of 4 or more of any cancers in the list of relatives above other than "simple" 
skin cancers? 

© Copyright 1996, OncorMed, Inc. 
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Genomic DNA was isolated from white blood cells of five subjects selected 
from analysis of their answers to the questions above. Dideoxy sequence analysis 
was performed following polymerase chain reaction amplification. 

All exons of the BRCAl gene were subjected to direct dideoxy sequence 
5 analysis by asymmetric amplification using the polymerase chain reaction (PGR) 
to generate a single stranded product amplified from this DNA sample. 
Shuldiner, et al. Handbook of Techniques in Endocrine Research, p. 457-486, 
DePablo,F., Scanes, C., eds.. Academic Press, Inc., 1993. Fluorescent dye was 
attached for automated sequencing using the Taq Dye Terminator® Kit (Perkin- 
10 Elmer cat# 401628). DNA sequencing was performed in both forward and reverse 
directions on an Applied Biosystems, Inc. (ABI) automated Model 377® 
sequencer. The software used for analysis of the resulting data was Sequence 
Navigator® software purchased through ABL 

15 1. Polymerase Chain Rea ction fPGR^ AmplifiratioTi 

Genomic DNA (100 nanograms) extracted from white blood cells of five 
subjects. Each of the five samples was sequenced end to end. Each sample was 
amplified in a final volume of 25 microliters containing 1 microliter (100 
nanograms) genomic DNA, 2.5 microHters lOX PGR buffer (100 mM Tris, pH 8.3, 

20 500 mM KGl, 1.2 mM MgGy, 2.5 microliters lOX dNTP mix (2 mM each 
nucleotide), 2.5 microliters forward primer, 2.5 microliters reverse primer, and 1 
microliter Taq polymerase (5 units), and 13 microliters of water. 

The primers in Table n, below were used to carry out amplification of the 
various sections of the BRCAl gene samples. The primers were synthesized on 

25 an DNA/RNA Model 394® Synthesizer. 
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TABLE n 

BRCAl PRIMERS AND SEQUENCING DATA 

SEQ.ID 







SEQUENCE 
















NO. 


MER 


IVI 


SIZE 






2F 


5' 








ATT 


TTA 


TAA 


ACC TTT-3* 


7 


24 


1 .6 


-275 






2R 


5* 


TGT 


CTT 


TTC 


TTC 


CCT 


AGT 


ATG T-3' 


8 


22 










3F 


5' 


1 




CAC 


AGC 


AGA 


CAT 


TTA-3' 


9 


2 1 


1 .4 


-375 


1 0 




3R 


5' 


TTG 


GAT 


TTT 


CGT 


TCT 


CAC 


TTA-3' 


1 0 


21 








tAUiN b 


5F 


5' 


L/ 1 O 


TTA 
1 1 M 




GCA 


GTT 


GTG 


AG-3' 


1 1 


20 


1 .2 


-275 






5R 


5* 


TTC 


GTA 


CTG 


TCG 


TTG 


CTT 


OC 


12 


201 








EXON6 


6/7F 


5' 


PTT 


ATT 


TTA 


GTHG 


TCC 


TTA 


AAA GG-3' 


1 3 


23 


1 .6 


-250 






6R 


5' 


TTT 


CAT 


GGA 


CAG 


CAC 


TTG 


AGT G-3' 


14 


22 








EXON7 


7F 


c • 
O 




AAC 


AAA 


GAG 


CAT 


ACA 


TAG GG-3' 


1 5 


23 


1 .6 


-275 






6/7R 


c • 

o 




V30 1 


TCA 


CTC 


TGT 


AGA 


AG-3' 


1 6 


20 






































EXON8 


8F1 


5' 


1 1 O 


TPT 


TCA 


GGA 


GGA 


AAA 


GCA-3' 


1 7 


2 1 


1 .2 


-270 






8R1 


5' 


OCT 


GOC 


TAG 


CAC 


AAA 


TAC 


AAA-3' 


1 8 


21 








EX0N9 


9F 


5' 


CCA 


CAG 


TAG 


ATG 


CTC 


AGT 


/^AATA-3' 


1 9 


23 


1 .2 


-250 


25 




9R 


5' 


TAG 


GAA 


AAT 


ACC 


AGC 


TTC 


ATA GA.3' 


20 


23 








EXOM 10 


10F 


5* 




TCA 


OCT 


TTC 


TGT 


AAT 


CG-3' 


21 


20 


1.6 


-250 






10R 


5' 


GTA 


TCT 


ACC 


CAC 


TCT 


CTT 


CTT CAG-3' 


22 


24 






O VJ 


EXON 11A11AF 


5* 




P-PT 


CPA 


AGG 


TGT 


ATC 


A-3' 


23 


1 9 


1 .2 


372 






11AR 


5' 


TGT 


TAT 


GTT 


QQC 


TCC 


TTG 


CT-3' 


24 


20 








EXON11B11BF1 


b 


PAP 


T A A 


APA 


CAG 


AAT 


GAA 


TCT A-3; 


25 


2 1 


1 .2 


-400 






11BR1 


5' 


A A 


A A 


PPA 


PAA 


TAT 


TCA 


TCT A-3' 


26 


2 1 






35 
































EXON11C11CF1 


5' 


TGA 


TGG 


GGA 


ore 


TGA 


ATC 


AA-3' 


27 


' 20 


1 .2 


-400 






11CR1 


5' 


TCT 


OCT 


TTC 


TTG 


ATA 


AAA 


TCC T-3' 


28 


22 








EXON11D11DF1 


5' 


AGO 


GTG 


CCC 


TCA 


CAA 


ATA 


AA-3' 


29 


20 


1.2 


-400 


40 




11DR1 


5' 


TCA 


AGC 


GCA 


TGA 


ATA 


TGC 


CT-3' 


30 


20 








EXON11E11EF 


5' 


GTA 


TAA 


GCA 


ATA 


TOG 


AAC 


TGG A-3' 


31 ' 


22 


1.2 


388 






11ER 


5' 


TTA 


AGT 


TCA 


CTG 


GTA 


TTT 


GAA CA-3' 


32 


23 






45 


EXON11F11FF 


5' 


GAC 


AGC 


GAT 


ACT 


TTC 


CCA 


GA-3' 


33 


20 


1.2 


382 






11FR 


5' 


TOG 


AAC 


AAC 


CAT 


GAA 


TTA 


GTC-3' 


34 


21 








EXON11G11GF 


5' 


GGA 


AGT 


TAG 


CAC 


TCT 


AGG 


GA-3' 


35 


20 


1.2 


423 






11GR 


5' 


GCA 


GTO 


ATA 


, TTA 


ACT 


GTC 


TGT A-3' 


36 


22 






50 































1 M13 tailed 



26 



55 



EX0N11H11HF 5* TOG GTC CTT AAA GAA AC A AAGT-3' 
11 HR 5' TCA GGT GAC ATT GAA TGX TCC-3' 



37 
38 



22 
21 



1.2 



366 



EXON11I1-HF 5' CCA CTT TTT CCC ATC AAG" TCA-3' 39 21 

11 IR 5' TCA GGA TGC TTA CAA TTA CTT C-3* 40 21 



1 .2 



377 



EX0N11J11JF 5' CAA AAT TGA ATG CTA TGC TTA GA-3' 41 23 1.2 377 

11JR 5* TCG GTA ACC CT3 AGC CAA AT-3' 42 20 

10 EXGN11K11KF 5' GCA AAAGCG TCC AGA AAG GA-3' 43 20 1.2 396 

11KR-15' TAT TTG CAG TCA AGT CTT CCA A-3* 44 22 



15 



360 



EXON11L11LF-1 5' GTA ATA TTG GCA AAG GCA TCT-3' 45 22 1.2 

11LR 5' TAA AAT G7B CTC CCC AAA AGC A-3' 46 22* 

EX0N12 12F 5* GTC CTt3 CCA ATG AGA AGA AA-S' 47 20 1.2 -^300 

12R 5' TGT CAG CAA ACC TAA GAA TGT-3* 48 21 



EX0N13 13F 5' AAT GGA AAG CTT CTC AAAGTA-3' 
20 13R 5' ATG TTG GAG CTA GGT CCT TAC-3* 



49 
50 



21 
21 



1.2 



-325 



EXON14 14F 5' CTA ACC TGA ATT ATC ACT ATC A-3* 51 22 

14R 5' GTO TAT AAATGC CTG TAT GCA-3' 5 2 21 



1.2 



-310 



25 EXON15 15F 5' TOG CTG CCC AGG AAG TAT G-3' 53 19 

15R 5' AAC CAG AAT ATC TTT ATG TAG GA-3* 54 23 



1.2 



-375 



30 



EX0N16 16F 5' AAT .TCT TAA CAG AGA CCA GAA C-3' 55 22 1.6 -550 

16R 5' AAA ACT CTT TCC AGA ATG TTG T-3* 56 22 

EXON17 17F 5" GTG TAG AAC GTG CAG GAT TG-3' 57 20 1.2 -275 

17R 5* TCG CCT CAT GTTG GTT TTA-3' 58 18 



EXON18 18F 5* GQC TCT TTA GCT TCT TAG GAC-3' 
35 18R 5' GAG ACC ATT TTC CCA GCA TC-3' 



59 
60 



21 
20 



1.2 -350 



EXON19 19F 5* CTG TCA TTC TTC CTG TGC TC-3' 

19R 5* CAT TGT TAA GGA AAG TOG TGC-3' 

40 EXON20 20F 5' ATA TGA CGT GTC TGC TCC AC-3' 

20R 5' QGG AAT CCA AAT TAC ACA GC-3' 



■61 
62 

63 
64 



20 
21 

20 
20 



1.2 



1.2 



-250 



-425 



45 



-300 



EXON21 21F 5' AAG CTC TTC CTT TTT GAA AGT C-3' 65 22 1.6 

21 R 5* GTA GAG AAA TAG AAT AGC CTC T-3' 6 6 22 

EXON22 22 F 5* TCC CAT TGA GAG GTC TTG CT-3' 67 20 1.6 ^300 

22R 5' GAG AAG ACT TCT GAG GCT AC-3' 63 20 



EXON23 23F.1 5' TGA AGT GAC AGT TCC AGT AGT-3' 69 21 

50 23R-1 5' CAT TTT AGC CAT TCA TTC AAC AA-3' 70 2 3 



1 .2 



-250 



EXON24 24F 5' ATG AAT TGA CAC TAA TCT CTG C-3' 71 22 

24R 5' GTA OX AGG ACA GTA GAA GGA-3' 7 2 21 



1 .4 



-285 



27 



Thirty-five cycles were performed, each consisting of denaturing (95°C; 30 
seconds), annealing (SS^C; 1 minute), and extension (72°C; 90 seconds), except during 
the first cycle in which the denaturing time was increased to 5 minutes, and during the 
last cycle in which the extension time was increased to 5 minutes. 
5 PGR products were purified using Qia-quick® PGR purification kits (Qiagen cat# 

28104; Ghatsworth, GA). Yield and purity of the PGR product determined 
spectrophotometrically at OD260 on a Beckman DU 650 spectrophotometer. 

2. Dideoxy Sequence Analysis 

Fluorescent dye was attached to PGR products for automated sequencing using the Taq 
10 Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing was performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) Foster City, 
GA., automated Model 377® sequencer. The software used for analysis of the resulting 
data was "Sequence Navigator® software" purchased through ABI. 

3. RESULTS 

15 Differences in the nucleic acids of the ten alleles from five individuals were foxmd in 
seven locations on the gene. The changes and their positions are found on TABLE HI, 
below. 
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ii ,11 "^S'- y K":: ii -"a. \r-" o jit 



5 AMINO Nucleotide 
ACID CHANGE Change 1 



TABLE in 
PANEL TYPING 



FREQUENCY 



10 



SER(SER) 
(694) 

LEU(LEU) 
(771) 



11E 



11F 



C/C C/T C/T T/T T/T 



T/T C/T C/T C/C C/C 



0.4 C 
0.6 T 

0.4 T 
0.6 C 



PRO(LEU) 
15 (871) 



11G 



C/T C/T C/T T/T T/T 



0.3 C 
0.7 T 



GLU(GUY) 
(1038) 



111 



A/ A A/G A/G G/G G/G 



0.4 A 
0.6 G 



20 LYS(ARG) 
(1 183) 



1 1J 



A/ A A/G A/G G/G G/G 



0.4 A 
0.6 G 



25 



SER(SER) 
(1436) 

SER(GLY) 
(1613) 



1 3 



1 6 



T/T T/T T/C C/C C/C 



A/A A/G A/G G/G G/G 



0.5 T 
0.5 C 

0.4 A 
0.6 G 
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10 



15 



Tables 3 and 4 depict one aspect of the invention, sets of at least two alternative 
codon pairs wherein the codon pairs occur in the following frequencies, 
respectively, in a population of individuals free of disease: 

at position 2201, AGC and AGT occur at frequencies from about 35- 
45%, and from about 55-65%, respectively; 

at position 2430, TTG and CTG occur at frequencies from about 35- 
45%, and from about 55-65%, respectively; 

at position 2731, CCG and CTG occur at frequencies from about 25- 
35%, and from about 65-75%, respectively; 

at position 3232, GAA and GGA occur at frequencies from about 35- 
45%, and from about 55-65%, respectively; 

at position 3667, AAA and AGA occur at frequencies from about 35- 
45%, and from about 55-65%, respectively; 

at position 4427, TCT and TCC occur at frequencies from about 45- 
55%, and from about 45-55%, respectively; and 

at position 4956, AGT and GGT occur at frequencies from about 35- 
45%, and from about 55-65%, respectively. 



The data show that for each of the samples. The BRCAl gene is identical except 
20 in the region of seven polymorphisms. These polymorphic regions, together with their 
locations, the amino acid groups of each codon, the frequency of their occurrence and 
the amino acid coded for by each codon are found in TABLE IV below. 

TABLE rv * 

25 CODON AND BASE CHANGES IN SEVEN POLYMORPHIC SITES OF BRCAl GENE 

SAMPLE BASE • POSITION CCDCN AA PUBLISHED R=mJE^JCY 

NAME CHANGE nt/aa EXCN CHANGE CHANGE R=IEQUENCY2 IN THIS STUDY 
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2,3,4,5 C-T 2201/694 11E AGC(AGT) SER-SER UNPUBUSHED C=40% 



-2.3,4.5 T-C 2430/771 11F TTG(CTG) LEU-LEU T=67%i3 



T=40% 



1.2.3.4.5 C-T 2731/871 11G CCG(CTG) PRaLEU C=34%12 



C=30% 
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2,3,4.5 A-G 3232/1038 111 GAA(GGA) GLU-GLY A=:67%^3 A = 40% 

2,3,4,5 A-G 3667/1183 1 1J AAA(AGA) LYS-ARG A=68%^2 A=40% 

5 3.4,5 T-C 4427/1436 13 TGT(TCC) SER-SER T=:67%12 T=50% 

2,3,4.5 A-G 4956/1613 1 6 AGT{GGT) SER-GLY A=67%^2 A==40% 



^Reference numbers correspond to the Table of References below. 

10 
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EXAMPLE 2 

Determination Of A Individual Using BRCAl^ QMl l And The Seven Polymorp hism?; 
For Reference 

A person skilled in the art of genetic susceptibility testing will find the present 
5 invention useful for: 

a) identifying individuals having a BRCAl gene, who are therefore have no 
elevated genetic susceptibility to breast or ovarian cancer from a BRCAl 
mutation; 

b) avoiding misinterpretation of polymorphisms found in the 
10 BRCAl gene; 

Sequencing is carried out as in EXAMPLE 1 using a blood sample from the patient in 
question. However, a BRCAl (^^i^^) sequence is used for reference and the polymorphic 
sites are compared to the nucleic acid sequences listed above for codons at each 
polymorphic site. A sample is one which compares to a BRCA1(°"^*) sequence and 
15 contains one of the base variations which occur at each of the polymorphic sites. The 
codons which occur at each of the polymorphic sites are paired here reference. 

AGC and AGT at position 2201, 

TTG and CTG at position 2430, 

• CCG and CTG at position 2731, 
20 • G AA and GG A at position 3232, 

• AAA and AGA at position 3667, 

• TCT and TCC at position 4427, and 
AGT and GGT at position 4956. 

The availability of these polymorphic pairs provides added assurance that one skilled in 
25 the art can correctly interpret the polymorphic variations without mistaking a 
variation for a mutation. 

Exon 11 of the BRCAl gene is subjected to direct dideoxy sequence analysis by 
asymmetric amplification using the polymerase chain reaction (PCR) to generate a 
single stranded product amplified from this DNA sample. Shuldiner, et at., Handbook 
30 of Techniques in Endocrine Research, p. 457-486, DePablo,F., Scanes, C, eds.. Academic 
Press, Inc., 1993. Fluorescent dye is attached for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing i3 performed in 
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both forward and reverse directions on an Applied Biosystems, Inc. (ABI) automated 
Model 377® sequencer. The software used for analysis of the resulting data is "Sequence 
Navigator® software" purchased through ABI. 

L Polymerase Chain Reaction fPCR^ Amplification 

Genomic DNA (100 nanograms) extracted from white blood cells of the subject is 
amplified in a final volume of 25 microliters containing 1 microliter (100 nanograms) 
genomic DNA, 2.5 microliters lOX PGR buffer (100 mM Tris, pH 8.3, 500 mM KGl, 1.2 
mM MgGl2), 2.5 microliters lOX dNTP mix (2 mM each nucleotide), 2.5 microliters 
10 forward primer (BRCAl-llK-F, 10 micromolar solution), 2.5 microliters reverse primer 
(BRCAl-llK-R, 10 micromolar solution),and 1 microliter Taq polymerase (5 units), and 
13 microliters of water. 

The PGR primers used to amplify a patient's sample BRGAl gene are listed in Table II. 
The primers were synthesized on an DNA/RNA Model 394® Synthesizer. Thirty-five 
15 cycles are of amplification are performed, each consisting of denaturing (95°C; 30 
seconds), armealing (SS'^C; 1 minute), and extension (72''C; 90 seconds), except during 
the first cycle in which the denaturing time is increased to 5 minutes, and during the 
last cycle in which the extension time is increased to 5 minutes. 

PGR products are purified using Qia-quick® PGR purification kits (Qiagen, cat# 28104; 
20 Ghatsworth, GA), Yield and purity of the PGR product determined 
spectrophotometrically at OD260 on a Beckman DU 650 spectrophotometer. 

2. DidQQxy S^gu^ncg A^alysi$ 

Fluorescent dye is attached to PGR products for automated sequencing using the Taq 
25 Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) Foster Gity, 
GA., automated Model 377® sequencer. The software used for analysis of the resulting 
data is "Sequence Navigator® software" purchased through ABI. The BRCAlto"^^) SEQ. 
ID. NO.:l sequence is entered into the Sequence Navigator® software as the Standard for 
30 comparison. The Sequence Navigator® software compares the sample sequence to the 
BRGAl(omii) SEQ. ID. NO.:l standard, base by base. The Sequence Navigator"^ software 



33 



V 



highlights all differences between the BRCAl(omii) SEQ. ID. NO.:l DNA sequence and 
the patient's sample sequence. 

A first technologist checks the computerized results by comparing visually the 
,BRCAl(oi"ii) SEQ. ID. NO.:l standard against the patient's sample, and again highlights 
5 any differences between the standard and the sample. The first primary technologist 
then interprets the sequence variations at each position along the sequence. 
Chromatograms from each sequence variation are generated by the Sequence 
Navigator® software and printed on a color printer. The peaks are interpreted by the 
first primary technologist and a second primary technologist. A secondary technologist 
10 then reviews the chromatograms. The results are finally interpreted by a geneticist. In 
each instance, a variation is compared to known polymorphisms for position and base 
change. If the sample BRCAl sequence matches the BRCAl(o'^ii) SEQ. ID, NO.:l 
standard, with only variations within the known list of polymorphisms, it is 
interpreted as a gene sequence. 

15 

EXAMPLE 3 

DETERMINING T HE ABSENCE OF A MUTATION IN THE BRCAl GENE USING 
BRCAli gmill AND SEVEN POLYMORPHISMS FOR REFERENCE 

A person skilled in the art of genetic susceptibility testing will find the present 
20 invention useful for determining the presence of a known or previously unknown 
mutation in the BRCAl gene. A list of mutations of BRCAl is publicly available in the 
Breast Cancer Information Core at: 

http://www.nchgr.nih,gov/dir/lab_transfer/bic. This data site became publicly 
available on November 1, 1995. Friend, S. et al Nature Genetics 11:238, (1995). 

25 Sequencing is carried out as in EXAMPLE 1 using a blood sample from the patient in 
question. However, a BRCAlC^n^O sequence is used for reference and polymorphic sites 
are compared to the nucleic acid sequences listed above for codons at each polymorphic 
site, A sample is one which compares to the BRCAl C^"^) seq nD. NO.: 3 sequence and 
contains one of the base variations which occur at each of the polymorphic sites. The 

30 codons which occur at each of the polymorphic sites are paired here reference. 
AGC and ACT at position 2201, 
TTG and CTG at position 2430, 
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CCG and CTG at position 2731, 
GAA and GGA at position 3232, 
AAA and AGA at position 3667, 
• TCT and TCC at position 4427, and 
5 • AGT and GGT at position 4956. 

The availability of these polymorphic pairs provides added assurance that one skilled in 
the art can correctly interpret the polymorphic variations without mistaking a 
variation for a mutation. 

Exon 11 of the BRCAl gene is subjected to direct dideoxy sequence analysis by 
10 asymmetric amplification using the polymerase chain reaction (PGR) to generate a 
single stranded product amplified from this DNA sample. Shuldiner, et al. Handbook 
of Techniques in Endocrine Research, p. 457-486, DePablo,F., Scanes, C., eds.. Academic 
Press, Inc., 1993. Fluorescent dye is attached for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
15 both forward and reverse directions on an Applied Biosystems, Inc. (ABI) automated 
Model 377® sequencer. The software used for analysis of the resulting data is "Sequence 
Navigator® software" purchased through ABI. 

1. Polymerase Chain Reaction fPCR^ Amplification 

20 Genomic DNA (100 nanograms) extracted from white blood cells of the subject is 
amplified in a final volume of 25 microliters containing 1 microliter (100 nanograms) 
genomic DNA, 2.5 microliters lOX PGR buffer (100 mM Tris, pH 8.3, 500 mM KCl, 1.2 
mM MgCl2)/ 2.5 microliters lOX dNTP mix (2 mM each nucleotide), 2.5 microliters 
forward primer (BRCAl-llK-F, 10 micromolar solution), 2.5 microliters reverse primer 

25 (BRCAl-llK-R, 10 micromolar solution),and 1 microliter Taq polymerase (5 units), and 
13 microliters of water. 

The PGR primers used to amplify a patient's sample BRCAl gene are listed in Table 11. 
The primers were synthesized on an DNA/RNA Model 394® Synthesizer. Thirty-five 
cycles are of ampUfication are performed, each consisting of denaturing (95''C; 30 
30 seconds), annealing (55°C; 1 minute), and "extension (72*'C; 90 seconds), except during 
the first cycle in which the denaturing time is increased to 5 minutes, and during the 
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last cycle in which the extension time is increased to 5 minutes. 

PCR products are purified using Qia-quick® PCR purification kits (Qiagen, cat* 
28104; Chatsworth, CA). Yield and purity of the PCR product determinec 
5 spectrophotometrically at OD260 on a Beckman DU 650 spectrophotometer. 

2. Dideoxy Sequence Analysis 

Fluorescent dye is attached to PCR products for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-EImer cat# 401628). DNA sequencing is performed in 
10 both forward and reverse directions on an Applied Biosystems, Inc. (ABI) Foster City, 
CA., automated Model 377® sequencer. The software used for analysis of the resulting 
data is "Sequence Navigator® software" purchased through ABI. The BRCAK^^^i^) SEQ. 
ID. NO.: 3 sequence is entered into the Sequence Navigator® software as the Standard 
for comparison. The Sequence Navigator® software compares the sample sequence to 
15 the BRCAl(onii2) sEQ. ID. NO.: 3 standard, base by base. The Sequence Navigator® 
software highlights all differences between the BRCAl(o"»^> SEQ. ID. NO.: 3 DNA 
sequence and the patient's sample sequence, 

A first technologist checks the computerized results by comparing visually the 
BRCAl(o"^) SEQ. ID. NO.: 3 standard against the patient's sample, and again highlights 
20 any differences between the standard and the sample. The first primary technologist 
then interprets the sequence variations at each position along the sequence, 
Chromatograms from each sequence variation are generated by the Sequence 
Navigator® software and printed on a color printer. The peaks are interpreted by the 
first primary technologist and also by a second primary technologist. A secondary 
25 technologist then reviews the chromatograms. The results are finally interpreted by a 
geneticist. In each instance, a variation is compared to known polymorphisms for 
position and base change. If the sample BRCAl sequence matches the BRC Al (o"^) SEQ. 
ID. NO.: 3 standard, with only variations within the known list of polymorphisms, it is 
interpreted as a gene sequence. 

30 
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EXAMPLE 4 

DETERMINING THE PRESENCE OF A MUTATION IN THE BR CAl GENE USTMC 
BRCAll flmil AND SEVEN POLYMORPHISMS FOR REFERENCE 

A person skilled in the art of genetic susceptibility testing will find the present 
5 invention useful for determining the presence of a known or previously unknowr 
mutation in the BRCAl gene. A list of mutations of BRCAl is publicly available in the 
Breast Cancer Information Core at: 

http://www.nchgr.nih.gov/dir/lab_transfer/bic. This data site became publicly 
available on November 1, 1995. Friend, S. et al Nature Genetics 11:238, (1995). In this 

10 example, a mutation in exon 11 is characterized by amplifying the region of the 
mutation with a primer which matches the region of the mutation. 
Exon 11 of the BRCAl gene is subjec.ted to direct dideoxy sequence analysis by 
asymmetric amplification using the polymerase chain reaction (PCR) to generate a 
single stranded product amplified from thi3 DNA sample. Shuldiner, et aL, Handbook 

15 of Techniques in Endocrine Research, p. 457-486, DePablo,F., Scanes, C, eds.. Academic 
Press, Inc., 1993. Fluorescent dye is attached for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) automated 
Model 377® sequencer. The software used for analysis of the resulting data is "Sequence 

20 Navigator® software" purchased through ABI. 

1. Polymerase Chain Reaction fPCR) Amplification 

Genomic DNA (100 nanograms) extracted from white blood cells of the subject is 
amplified in a final volume of 25 microliters containing 1 microliter (100 nanograms) 
25 genomic DNA, 2.5 microliters lOX PCR buffer (100 mM Tris, pH 8.3, 500 mM KCl, 1.2 
mM MgCl2)/ 2.5 microliters lOX dNTP mix (2 mM each nucleotide), 2.5 microliters 
forward primer (BRCAl-llK-F, 10 micromolar solution), 2.5 microliters reverse primer 
(BRCAl-llK-R, 10 micromolar solution),and 1 microliter Taq polymerase (5 units), and 
13 microliters of water. 

30 The PCR primers used to amplify segment K of exon 11 (where the mutation is found) 
are as follows: 
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BRCAl-llK^F: 5'-GCA AAA GCG TCC AGA AAG GA-3' SEQ ID NO:69 

BRCAl-llK-R: 5'-AGT CIT CCA ATT CAC TGC AC-3' SEQ ID NO:70 

The primers are synthesized on an DNA/RNA Model 394® Synthesizer. 

Thirty-five cycles are performed, each consisting of denaturing (95°C; 30 seconds), 

annealing (55°C; 1 minute), and extension (72°C; 90 seconds), except during the firsi 

cycle in which the denaturing time is increased to 5 minutes, and during the last cycle 

in which the extension time is increased to 5 minutes. 

PCR products are purified using Qia-quick® PGR purification kits (Qiagen, cat# 28104; 
Chatsworth, CA). Yield and purity of the PCR product determined 
spectrophotometrically at OD260 on a Beckman DU 650 spectrophotometer. 

2. Dideoxy S equence Analysis 

Fluorescent dye is attached to PCR products for automated sequencing using theTaq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) Foster City, 
CA., automated Model 377® sequencer. The software used for analysis of the resulting 
data is "Sequence Navigator® software" purchased through ABI. The BRCAl(o"^) SEQ. 
ID. NO.: 3 sequence is entered into the Sequence Navigator® software as the Standard 
for comparison. The Sequence Navigator® software compares the sample sequence to 
the BRCAl(on^) SEQ. ID. NO.: 3 standard, base by base. The Sequence Navigator® 
software highlights all differences between the BRCAl(onu2) SEQ. ID. NO.: 3 DNA 
sequence and the patient's sample sequence. 

A first technologist checks the computerized results by comparing visually the 
BRCAl(omi2) SEQ, ID. NO.: 3 standard against the patient's sample, and again highlights 
any differences between the standard and the sample. The first primary technologist 
then interprets the sequence variations at each position along the sequence. 
Chromatograms from each sequence variation are generated by the Sequence 
Navigator® software and printed on a color printer. The peaks are interpreted by the 
first primary technologist and a second primary technologist. A secondary technologist 
then reviews the chromatograms. The results are finally interpreted by a geneticist. In 
each instance, a variation is compared to known polymorphisms for position and base 
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change. Mutations are noted by the length of non-matching variation. Such a lengthy 
mismatch pattern occurs with deletions and substitutions. 

3. Result 

5 Using the above PCR amplification and standard fluorescent sequencing 

technology. The 3888delGA mutation may be found. The 3888delGA mutation The 
BRCAl gene lies in segment "K" of exon 11. The DNA sequence results demonstrate 
the presence of a two base pair deletion at nucleotides 3888 and 3889 of the published 
BRCAl(o"^i) sequence. This mutation interrupts the reading frame of the BRCAl 

10 transcript, resulting in the appearance of an in-frame terminator (TAG) at codon 
position 1265. This mutation is, therefore, predicted to result in a truncated, and most 
likely, non-functional protein. The formal name of the mutation will be 3888delGA. 
This mutation is named in accordance with the suggested nomenclature for naming 
mutations, Baudet, A et ah, Human Mutation 2:245-248, (1993). 

15 

EXAMPLES 

USE OF THE BRCAli flmil l GENE THERAPY 

The growth of ovarian, breast or prostate cancer can be arrested by increasing the 
20 expression of the BRCAl gene where inadequate expression of that gene is responsible 
for hereditary ovarian, breast and prostate cancer. It has been demonstrated that 
transfection of BRCAl into cancer cells inhibits their growth and reduces 
tumorigenesis. Gene therapy is performed on a patient to reduce the size of a tumor. 
The LXSN vector is transformed with any of the BRCAl SEQ. ID. NO.:l, 

25 BRCAl(^n^) SEQ. ID. NO.:3, or BRCAl(<^'"i3) SEQ. ID. NO.:S coding region. 

Vector 

The LXSN vector is transformed with wildtype BRCAl^^^^^^^ SEQ. ID. NO.:l coding 
sequence. The LXSN-BRCAlfo'"*^) retroviral expression vector is constructed by cloning 
30 a Sa/I-linkered BRCAK^mii) cDNA (nucleotides 1-5711) into the X^iol site of the vector 
LXSN. Constructs are confirmed by DNA sequencing. Holt ei aL Nature Genetics 12: 
298-302 (1996). 
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Retroviral vectors are manufactured from viral producer cells using serum free and 
phenol-red free conditions and tested for sterility, absence of specific pathogens, and 
absence of replication-competent retrovirus by standard assays. Retrovirus is stored 
frozen in aliquots which have been tested. 

Patients receive a complete physical exam, blood, and urine tests to determine 
overall health. They may also have a chest X-ray, electrocardiogram, and appropriate 
radiologic procedures to assess tumor stage. 

Patients with metastatic ovarian cancer are treated with retroviral gene therapy by 
infusion of recombinant LXSN-BRCAl(o'"ii) retroviral vectors into peritoneal sites 
containing tumor, between 10^ and IQio viral particles per dose. Blood samples are 
drawn each day and tested for the presence of retroviral vector by sensitive polymerase 
chain reaction (PCR)-based assays. The fluid which is removed is analyzed to 
determine: 

15 1. The percentage of cancer cells which are taking up the recombinant LXSN- 
BRCAl(omii) retroviral vector combination. Successful transfer of BRCAl gene into 
cancer cells is shown by both RT-PCR analysis and in situ hybridization. 
RT-PCR is performed with by the method of Thompson et al Nature Genetics 9: 444- 
450 (1995), using primers derived from BRCAl(oniii) SEQ. ID. NO.;l. Cell lysates are 
prepared and immimoblotting is performed by the method of Jensen et al. Nature 
Genetics Ig: 303-308 1996) and Jensen et al Biochemistry^: 10887-10892 (1992). 

2. Presence of programmed cell death using ApoTAG® in situ apoptosis detection kit 
(Oncor, Inc., Gaithersburg, Maryland) and DNA analysis. 

25 

3. Measurement of BRCA I gene expression by slide immunofluorescence or western 
blot. 

Patients with measurable disease are also evaluated for a clinical response to LXSN- 
BRCAI, especially those that do not undergo a palliative intervention immediately after 
30 retroviral vector therapy. Fluid cytology, abdominal girth, CT scans of the abdomen, 
and local symptoms are followed. 
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For other sites of disease^ conventional response criteria are used as follows: 

1. Complete Response (CR), complete disappearance of all measurable lesions 
and of all signs and symptoms of disease for at least 4 weeks. 

2. Partial Response (PR), decrease of at least 507o of the sum of the products of the 
5 2 largest perpendicular diameters of all measurable lesions as determined by 2 

observations not less than 4 weeks apart. To be considered a PR, no new lesions should 
have appeared during this period and none should have increased in size. 

3. Stable Disease, less than 25% change in^ tumor volume from previous 
evaluations. 

10 4. Progressive Disease, greater than 25% increase in tumor measurements from 
prior evaluations. 

The number of doses depends upon the respor\se to treatment. 
For further information related to this gene therpay approach see in "BRCAl 
Retroviral Gene Therapy for Ovarian Cancer" a Human Gene Transfer Protocol: NIH 
15 ORDA Registration #: 9603-149 Jeffrey Holt, JT, M.D. and Carlos L. Arteaga, M.D. 
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"Breast and Ovarian cancer" is understood by those skilled in the art to 
include breast and ovarian cancer in women and also breast and prostate cancer in men. 
BRCAl is associated genetic susceptibility to inherited breast and ovarian cancer in 
women and also breast and prostate cancer in men. Therefore, claims In this document 
15 which recite breast and/or ovarian cancer refer to breast, ovarian and prostate cancers in 
men and women. Although the invention has been described with reference to the 
presently preferred embodiments, it should be vmderstood that various, modifications 
can be made without departing from the spirit of the Invention. Accordingly, the 
invention is limited only by the following claims. 
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